Environment Adaptation and Long Term Parameters in Speaker Identi cation

نویسندگان

  • Chakib Tadj
  • Pierre Dumouchel
  • Mohamed Mihoubi
  • Pierre Ouellet
چکیده

In this paper, we have integrated in a GMM based speaker identi cation system two di erent techniques: a) Maximum Likelihood Linear Regression (MLLR) transformation which adapts the system to the new environment based on modifying the continuous densities of the GMM mixtures. We apply the MLLR to perform environmental compensation by reducing a mismatch due to channel or additive noise e ects, b) Linear Discriminant Analysis (LDA) applied on sequences of acoustic vectors. LDA extracts, from these sequences, a set of discriminant parameters maximizing the class separability by designing a linear transformation. Previous works have shown that application of LDA to speech recognition problem increases performance of speech recognition system. We use this approach to extract features that are more invariant to non-speakers-related conditions such as handset types and channel e ects. Experiments are done on 45 speaker's Spidre database.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Environment adaptation and long term parameters in speaker identification

In this paper, we have integrated in a GMM based speaker identi cation system two di erent techniques: a) Maximum Likelihood Linear Regression (MLLR) transformation which adapts the system to the new environment based on modifying the continuous densities of the GMM mixtures. We apply the MLLR to perform environmental compensation by reducing a mismatch due to channel or additive noise e ects, ...

متن کامل

Bispectrum features for robust speaker identification

Along with the spoken message, speech contains information about the identity of the speaker. Thus, the goal of speaker identi cation is to develop features which are unique to each speaker. This paper explores a new feature for speech and shows how it can be used for robust speaker identi cation. The results will be compared to the cepstrum feature due to its widespread use and success in spea...

متن کامل

Speech compression with preservation of speaker identity

Although much e ort has been directed recently towards speech compression at rates below 4 kb/s, the primary metric for comparison has, understandably, been the amount of spectral distortion in the decompressed speech. However, an aspect which is becoming important in some applications is the ability to identify the original speaker from the coded speech algorithmically. We investigate here the...

متن کامل

Using maximum likelihood linear regression for segment clustering and speaker identification

Many adaptation scenarios rely on clustering of either the test or training data. Although consistency between the clustering and adaptation objective functions is desired, most previous approaches have not implemented such consistency. This paper shows that the statistics used in Maximum Likelihood Linear Regression (MLLR) adaptation are su cient to cluster data with a consistent Maximum Likel...

متن کامل

Selective use of the speech spectrum and a VQGMM method for speaker identification

This paper describes two separate sets of speaker identi cation experiments. In the rst set of experiments, the speech spectrum is selectively used for speaker identi cation. The results show that the higher portion of the speech spectrum contains more reliable idiosyncratic information on speakers than does the lower portion of equal bandwidth. In the second set of experiments, a vector-quanti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999